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ABSTRACT. The construction of a dendogram on a set of individuals is a key component 
of a genomewide association study. However even with modern sequencing technologies 
the distances on the individuals required for the construction of such a structure may not 
always be reliable making it tempting to exclude them from an analysis. This, in turn, 
results in an input set for dendogram construction that consists of only partial distance in- 
formation. By formalizing a dendogram in terms of an edge-weighted, rooted phylogenetic 
tree on a pre-given finite set X with | X \ > 3 whose edge- weighting is equidistant and a set 
of partial distances on X in terms of a set JS? of 2-subsets of X, we investigate the problem 
of when such a tree is lassoed, that is, uniquely determined by the elements in S£ . For this 
we consider four different formalizations of the idea of "uniquely determining" giving rise 
to four distinct types of lassos. We present characterizations for all of them in terms of 
the child-edge graphs of the interior vertices of such a tree. Our characterizations imply 
in particular that in case the tree in question is binary then all four types of lasso must 
coincide. 
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1. Introduction 

Years of selective breeding have resulted in large numbers of different varieties for, 
for example, oilseed rape and rice and also numerous animal breeds including dogs and 
chicken. Genomewide association studies constitute a powerful tool to try and link the ob- 
served phenotypic variability between the varieties (we will collectively refer to a variety 
and a breed as a variety) with variations in the genomes of the variety. A key component of 
such a study is phenetic clustering whereby one aims to construct a dendogram for a set of 
individuals from within a variety of interest indicating levels of similarity between them. 
Using some sort of distance measure this similarity can be based on e. g. morphological 
traits such as grain type or Single Nucleotide Polymo rphism (SNP) marke r s obtained 



through next generation se quencing technology (see e. g. lHarper et all din press!) : lMuir et al 
d2008b : IZhanget all d2010t) l or examples of such studies). 



From a formal point of view, a dendogram can be thought of as a pair (T, co) consisting 
of (i) a rooted tree T with leaf set a given non-empty finite set X (e. g. individuals), no 
degree two vertices but possibly a distinguished vertex pj of T called the root of T, and all 
other non-leaf vertices of T of degree at least three (we will refer to such a tree simply as an 
X-tree), and (ii) an edge- weighting CO : E(T) — > R>o for T that is equidistant which means 
that the induced distance Dry a -\(pT,x) from pj to every leaf x € X of T is the same (as 
with all relevant concepts, we refer the reader to the next section for a precise definition). 
With and without the equidistance requirement such pairs (T, co) have generated a lot of 
interest in the literature and so it is not surprising that numerous deep results for them are 
known provided the distance information from which to construct T and CO is complete 
in t he sense that for al l elements x and y in X th e distance between x and y is given (see 



e. g. lDress et all d2012ah : lserriple and Steel (12003b '). 
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However even for data generated with modern sequencing technologies the required 
distance measures need not always be reliable (or may simply be missing) resulting in 
only partial distance information for dendogram reconstruction. From the perspective of 
the aforementioned formalization of such a structure, the problem thus becomes (a) how to 
construct an equidistant X-tree (i. e. an X-tree with equidistant edge weighting) from partial 
distance information and, (b) if such a tree exists, when is it lassoed by that information, 
that is, unique in the sense that any two equidistant X-trees for which, with regards to the 
known distances on X, the induced distances on X coincide must be equivalent? 

Although approache s for tackl i ng the first problem exist in the form of, for example, an 
approach introduced in De S oetej (1984) not much is known about the second. A notable 
exception is a study in iDress et a ll d2012bh carried out for the unrooted analogon of an 
equidistant X-tree. Viewing partial distance information on a set X as a set of cords, that is, 
subsets of X of size two, the authors considered the following four natural interpretations 
of the above uniqueness problem. Namely, given a set Jz? C and an edge-weighted 
unrooted phylogenetic tree T on X when 

(i) is the edge-weighting of T uniquely determined by Jz? ?, 

(ii) is the shape i. e. the topology of T uniquely determined by Jzf ?, 

(iii) are both the edge-weighting and the topology of T uniquely determined by Jzf?, 
and 

(iv) when is the topology of T uniquely determined by Jzf up to T being obtained from 
another X-tree by collapsing edges?. 

Formalized as Jzf being an edge-weight/topol ogical/strong/weak lasso for an unrooted 
phylogenetic tree with leaf set X, the authors of Dress et a3 ( 2012b ) showed that all four 
concepts are distinct. Also, they presented results that allowed them to not only investigate 
the above types of lasso from a recursive point of view but also characterize under what 
circumstances a specifically constructed set of cords is a topological lasso (see Section [8] 
for more on this). However a characterization for the general case eluded them. 

Replacing the concept of an edge-weight lasso by that of an equidistant lasso to reflect 
the fact that for the edge-weighted X-trees of interest here the induced distance from the 
root to any leaf of such a tree is the same, we show that for X-trees the situation changes. 
More precisely, we present for an X-tree T characterizations for when a set Jzf C is 
weak lasso for T (Theorem 15. It . an equidistant lasso for T (Theorem l6.U , and for when 
it is a topological lasso for T (Theorem 17. j} in terms of the child edge graph Gr(Jzf,v) 
that can be canonically associated to every non-leaf vertex v of T via its child edges. 
Our characterizations can be thought of as a spectrum on the connectedness of that graph 
with the extreme situations being an equidistant lasso and a topological lasso. They im- 
ply that every topological lasso and every non-empty weak lasso must be an edge-weight 
lasso ((Corollaries 17.21 and I6.21 i and that in case T is binary the notions of an equidis- 
tant lasso and a topological lasso (and thus a weak lasso) coincide. Consequently, every 
edge-weight/topological lasso is also a strong lasso in that case (C orollary 17. 21. We a lso 
investigate two special types of sets of cords originally introduced in Dress et aj ( 2012b ) in 
the light of our findings above. This investigation shows in particular that it is possible for 
the concepts of an equidistant and a topological lasso to coincide without the X-tree they 
are referring to being binary. 

The outline of the paper is as follows. In the next section, we introduce relevant ter- 
minology. In Section [3] we present first characterizations for when a set of cords is a 
topological/equidistant/weak lasso for an X-tree (Theorem 13.21 . In Section |4] we intro- 
duce the child-edge graph associated to a non-leaf vertex of an X-tree and present first 
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properties of it concerning coralling sets of cords. In Section|5j we establish Theorem l5.ll 
In Section|6] we show Theorem l6.1l and in Section|7] we prove Theorem l7.ll In Section[SJ 
we present two general ways for constructing for an X-tree T two different sets of cords of 
(2) and discuss their properties in in the context of lassoing and coralling T. We conclude 
with Section [9] where we also present some open problems. 



2. Preliminaries 

Assume from now on that X is always a finite set with at least 3 elements. For a cord of 
X with elements a and b we write ab rather than {a,b}. 

Suppose for the following that T is an X-tree. Then we call a vertex of T that is not 
a leaf of T an interior vertex of T, an edge e that is incident with a leaf of T a pendant 
edge of T, and an edge of T that is incident with two interior vertices of T an interior 
edge of T. We denote the set of all interior vertices of T by V°(T) and the root of T by 
Pt- Let T' be a further X-tree. Then we say that T and T' are equivalent if there exists 
a bijection : V(T) — > V(T') that extends to a graph isomorphism between T and T' that 
is the identity on X and maps the root pj of T to the root p T > of T'. Also, we say that T' 
refin es T if, up to equivalence , T can be obtained from T' by collapsing edges of T' (see 



e. g. ISemple and Steell d2003l) ). In that case, we will also call T' a refinement of T, Note 
that T is its own refinement if and only if T is binary, that is, every interior vertex of T has 
degree three but the root which has degree two. 

Let to denote an edge-weighting for T, that is, a map to : E(T) — > R>o- Then we denote 
by {T,Co) the edge-weighted X-tree whose underlying X-tree and edge-weighting are T 
and CO, respectively, and by D(r £0 ) the distance induced by CO on V(T). We call CO proper 
if Co{e) > holds for all interior edges e of T and we call it equidistant if 

(El) O (r>£0) (x,pr) =-D(7»(y,Pr), for all*,? eX, and 

(E2) D( T a j (x, u) > Dfj m ^ (jc, v), for all x G X and any m, v 6 V(r) such that m is encoun- 
tered before v on the path from pj to x. 

Note that Property (E2) implies that, for all interior vertices v of T and all leaves ij£X 
of T for which v lies on the path from x to y, we have D( r £0 \(x,v) = D/ Ta) \(y,v). Also 
note tha t our definition of an equ idistant edge-weighting is slightly different from the one 
given inlSemple and Steell (12003b in so far that co is a map into M>o and not into R, as in 



Semple and Steell (120031) . 

Suppose _Sf C (2) is a set of cords. If T' is a further X-tree and o and co' are proper 
edge-weightings for T and T', respectively, we say that (T, co) and (T', CO 1 ) are _§f -isometric 
if £>(r ffl )(- 1 i;,} ; ) = £ , (r',(i)')(- ; c,3 ; ) hold s for all cords xy £ ££ . Canonically extending the cor- 



responding concepts introduced in Dress et al (2012bJ) for unrooted phylogenetic trees on 
X to X-trees, we say that is 

(i) an equidistant lasso for T if, for all equidistant, proper edge-weightings CO and co' 
of T, we have that co = co' holds whenever (T, co) and (T, to') are S£ -isometric 

(ii) a topological lasso for T if, for every X-tree T' and any equidistant, proper edge- 
weightings to of T and to' of T', respectively, we have that T and T' are equivalent 
whenever (T,to) and (7*' , co') are «5f -isometric. 

(iii) is a strong lasso for T if _Sf is simultaneously an equidistant and a topological 
lasso for 7*. 

(iv) a weafc tosso /or T if, for every X-tree T' and any equidistant, proper edge- 
weightings to of r and to' of 7*', respectively, we have that T is refined by T' 
whenever (T,to) and (T',to') are «5f -isometric. 
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Also, we say that a set Jz? C ( 2 ) of cords is an equidistant/topological/weak/strong lasso 
on X if there exists an X-tree T such that _§f is an equidistant/topological/weak/strong lasso 
for T. For the convenience of the reader, we illustrate the above types of lassos in Fig.Q] 
forX = {a,b,c,d,e}. 

A P 



a b c d e 

Figure 1. For X = {a,b,c,d,e}, the set _Sf = {ab, cd.de] is an 
equidistant lasso for T, the set Jz? = {ab,ac,bc,bd,de} is a topological 
lasso for T and also a strong lasso for T, and the set Jz? = {a£>, bc,cd,de} 
is a weak lasso for T. 



Note that we will also say that a set Jz? of cords of X coralls an X-tree r if Jz? is a weak 
lasso for T, that a topological lasso for T is in particular a weak lasso for T, and that the 
notions of a topological lasso for T and a weak lasso for T coincide if T is binary. Also 
note that for Jz? to be a topological/equidistant lasso we must have that Jz? ^ 0. However 
Jz? ^ need not hold for Jz? to corall an X-tree as every subset of ( 2 ) including the empty- 
set coralls the star-tree on X, that is the tree with a unique interior vertex w and leaf set 
X. 



3. A FIRST CHARACTERIZATION OF A TOPOLOGICAL/WEAK/EQUIDISTANT LASSO 

In this section we present a first characterization for when a set of cords of X is a 
topological/weak/equidistant lasso for an X-tree. To establish this characterization, we 
require further definitions and notations. 

Suppose for the following that T is an X-tree and that v 6 V"(T). Then we call an edge 
e of T incident with v that is not crossed by the path from the root pj of T to v a child edge 
of v. If e is incident with v but lies on the path from pj to v then we call it a parent edge of 
v. In the former case, we call the vertex incident with that edge but distinct from v a child 
of v and in the latter a parent of v. We call a vertex w of T distinct from v a descendant 
of v if there exists a path from v to w (possibly of length one) that crosses a child of v and 
denote the set of leaves of T that are also descendants of v by Lj ( v) . If v is a leaf of T, then 
we put Lt(v) := {v}. Also if there is no ambiguity as to which X-tree T we are referring 
then we will write L(v) rather than Lr(v). 

Suppose r is an X-tree. Then, for all x G X, we denote the edge of T incident with x 
by e x and the parent of x by v,. Moreover, we call a non-empty subset LC.X of leaves of 
T that all have the same parent v S V°(r) a pseudo-cherry of T if L = L(v). In that case, 
we also call v the parent of that pseudo-cherry. If {jq , . . . ,JCjt}, fe > 1, is a pseudo-cherry of 
some X-tree T then we will sometimes write x\,...,Xk rather than {x\ ,x^}. Note that 
every X-tree on three or more leaves that is not the star-tree on X must contain at least one 
pseudo-cherry. Also note that in case \L\ = 2 the definition of an pseu do-cherry reduces to 



that of a cherry in the usual sense (see e.g. iSemple and Steell (120031) ). In the special case 



that |X| =3, say X = {a,b,c}, and that T has a cherry, a,b say, we call T a triplet and 
denote T by ab\c (or, equivalently, c\ab). 
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Now assume again that T is an arbitrary X-tree. For any non-empty subset Y C X, we 
denote by T\y the minimum spanning tree in T that connects the leaves in Y (where we 
suppress resulting interior vertices v of Ty with a single child, that is, if the parent of v is w 
and u is the unique child of v then we delete v plus its incident edges from T\y and add the 
edge {w, u} to E(T\y)). Note that Ty is clearly a F-tree. We say for three pairwise distinct 
elements a,b,c G X that T displays the triplet ab\c if 7*|{ flj j jC } is equivalent with ab\c. We 
denote by &(T) the set of triplets displayed by T. As is well-known, any X-tree T can 
display at most ('^') triplets with equality holding if and only if T is binary. Furthermore, 
any X-tree T is uniquely determined by the set M(T) in the sense that if T' is a further 



X-tree and M(T) — & (T') holds then T and T ' must be equivalent (see e. g. dDress et all 
2012al Chapter 9) and isemple and Steel 120031 Section 6.4)). 



Observe that if T is an X-tree, CO is an equidistant, proper edge-weighting for T, and 
a,a',b G X are pairwise distinct elements then either all three pairwise distances induced 
onZ := {a,a* ,b} must coincide or two of the distances induced by it must coincide and the 
third one must be strictly less than that distance. Moreover, Dfj m ^{a,a') < D( T a> ^(a,b) = 
D(T,m){ a ' ',b) holds if and only if aa'\b £ffl(T) and we have D^ T m ^(a,a') = D(j^(a,b) = 
O(r,co) ( fl 'i b) if an d on ly if me Z-tree T\z is the star-tree on Z. 

Fhe next result is fundamental for the paper. 

Lemma 3.1. Suppose that T and T' are two X-trees with equidistant, proper edge-weightings 
CO and CO 1 , respectively, and let a, a 1 ,b GX denote three pairwise distinct elements such that 
D(T,m)( a i a ') = D (T',m')( a i a ') and D^ T a ^(a,b) = Dfji m i^a,b). Then the following hold: 

(0 I f D (T,a,){ a -, a ') < D (T.<i>){ a -,b)= D (T,a,){ a 'ib) then 
(1) D(r',a>') (a, a 1 ) < D {r al) (a,b) =D (r e ,, ) (a / ,fc). 

In particular, Diji m i\(a! \b) = Dt T m ^(a' ,b). 

(ii) aa'\b G &(T) if and only if ad\b G M{T'). 

(iii) If D( T(0 \(a' ,b) = Di t i :(0 i\(a' ,b) then T\z is the star-tree on Z := {a,a',b} if and 
only ifT'\z is the star-tree on Z. 

Proof, (i): NotethatD( rj(B )(a,a') =D(j^ ca i\{a,a') combined with D( T (0 \ (a, b) = D^ T i m i)(a,b) 
implies that we cannot haveD^/ ^{a,a') =D( r ; j0) /)(a,fe) as this would imply thatD( r ^ (a, a 1 ) ■ 
D(T,w)( a 'b) which is impossible. But then our assumptions imply that Dr T i m n(a,a') = 
D(T',a>') b) cannot hold either. Inequality ([T} now follows from the observation preced- 
ing the statement of the lemma. In particular, this implies D/p jCO i)(a', b) =D(j-/ )(B /)(a,fo) = 
D(T,co)(a,b) = D {T(0] (a' ,b). 

(ii) & (iii): This is an immediate consequence of (i) and the observation preceding the 
statement of the lemma. □ 

To be able to state the next result we require a further definition. Suppose if C Q and 
T is an X-tree. Let x,y G X be two distinct leaves of T that are contained in the same a 
pseudo-cherry of T. Then we put 

J£?i(*,y):={a6ej£? : x <£ {a,b}} U {ay : axe 3?}. 

Note that «£? = if and only if S£\ (x,y) = 0. 

Theorem 3.2. Suppose that Jz? C ( 2 ) is a set of cords and that T is an X-tree. Let x^y G X 
denote two distinct leaves ofT that are contained in the same pseudo-cherry ofT. Then 
the following hold. 



(1 



KATHARINA T. HUBER AND ANDREI- ALIN POPESCU 



(i) Jz? is a topological lasso for T if and only if S£\ (x,y) U {xy} is a topological lasso 
for T and xy £ Jz?. 

(ii) Jz? is an equidistant lasso for T if and only if Jz?i(x,y) U {xy} is an equidistant 
lasso for T andxy £ I£. 

(ii) If Jz? 7^ then Jz? is a weak lasso for T if and only if ' Jz?i (x,y) U {xy} is a weak 
lasso for T andxy £ Jz?. 

Proof, (i): Put Jz?j + := Jz?i (x,y) U {xy} and assume first that Jz? is a topological lasso for T. 
Then Jz? ^ 0. Let T' denote an X-tree and CO and co' equidistant, proper edge-weightings 
for T and T', respectively, so that (T, co) and (T', co') are Jzf^-isometric. To see that T and 
T' are equivalent it clearly suffices to show that (T, co) and (T', co') are also «Sf -isometric, 
that is, 

(2) D {Tm) (a,b)=D {T , M) , ] (a 7 b) 
holds for all ab £ «Sf . 

Suppose ab £ Jz? . If x £" then £ Jz?j + and so Equation (0 holds as (T, o) and 

(T 1 , co') are Jz?^ -isometric. So assume thatx £ {«,£>}, say x = a. lfy = b then ab = xy £ jSf 
and so Equation (f2]i holds by the same argument. If y ^ b then xb = ab £ and so 
yb€J&*. Since xy £ Jz?* also holds, we haveZ)( ri£0 )(x,fc) = Dm i<B ')(x,fo) by Lemma [3TT1 
and so Equation (O follows in this case, too. 

Conversely, suppose that Jz?j + is a topological lasso for r and that xy £ Jz? . Then Jz?^ ^ 
0. Assume that T' is an X-tree and that co and co' are equidistant, proper edge-weightings 
for T and T', respectively, so that (T, co) and (T' : co') are _Sf -isometric. To see that T and 
T' are equivalent it clearly suffices to show that (T, co) and (T 1 , co') are also Jz?[ + -isometric, 
that is, that 

(3) D (Tm) (a,b) =D {T , (0 , ] (a 7 b) 

holds for all ab £ Jz? t + . 

Suppose ab £ Jz?[ + . If x £ {a,b} then ab = xy £ Jz? and so Equation (O holds as (T, o) 
and (7*', co') are Jz? -isometric. So assume that x ^ {a,b}. Then a£> £ 5£\ (x,y). If y {a,£>} 
holds too then ab £ Jz? and so Equation (O holds by the same argument. So assume that 
y £ {a,b}, say, y = a. Then yZ? = ab £ Jzf^ and so one of yb £ Jz? and xb £ Jz? must hold 
by the definition of Jz?j + . If the former holds then ab = yb £ Jz? and so Equation (f3]i holds 
by assumption on (T,eo) and (T',co'). If x^ £ Jz? then Dt T ^ m \[y,b) = D( T i^(y,b) follows 
by Lemma lTTl since, by assumption, xy £ Jz?. But then Equation (01 holds in this case, too. 

(ii) & (iii): This follows using similar arguments as in the proof of (i). □ 

4. The child-edge graph 

In this section we first introduce the child-edge graph Gj ( Jz?, v) associated to an interior 
vertex v of an X-tree T and a non-empty set Jz? C of cords and then study some of its 
properties with regards to coralling an X-tree. We start with a definition. 

Suppose T is an X-tree, v £ V°(T), and Jz? C ( 2 ) is a non-empty set of cords. Then 
we call the graph Gr(Jz?,v) = (Vt. v ,Et. v ) with vertex set Vt.v the set of all child edges of 
v and edge set £Y v the set of all {e,e'} £ ( V ^ v ) for which there exist leaves a,b £X such 
that e and e' are edges on the path from a to b in T and ab £ Jz? the child-edge graph (ofv 
with respect to T and Jz?). Note that in case there is no danger of ambiguity with regards 
to the X-tree T we are referring to, we will write G(Jz?, v) rather than Gt (Jz?, v) and V v and 
E v rather than Vt.v and Et, v . 
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For T an X-tree, the next result provides a key insight into the structure of G(„Sf,v), 
v G V°(T), whenever _Sf coralls 7\ To state it, we denote by V°{T)~ the set of all interior 
vertices of T that are not a parent of a pseudo-cherry of 7\ 

Lemma 4.1. Suppose that T is an X-tree but not the star-tree on X, that Jz? C (^) is a 
non-empty set of cords that coralls T, and that v G V°(T). Then the following holds. 

(i) If v G V°(T)~ and e G V v is a child edge ofv that is not incident with a leaf of T 
then {e,e'} G E v , for all e' G V v that are incident with a leaf of T. In particular, 
G(_Sf,v) is connected. 

(ii) Ifv G V"(T) - V°{T)- then G(_Sf , v) is connected. 

Proof, (i): Let e G V v denote a child edge of v that is not incident with a leaf of T and 
let u G V(T) denote the child of v that is incident with e. Let CO : E(T) —> R>o be an 
equidistant, proper edge-weighting for T. Assume for contradiction that there exists a 
child u' G V(T) of v that is a leaf such that with e u i denoting the child edge of v incident 
with u', we have that {e,e u i} E v . Note that since co is equidistant and u' is a leaf of T 
whereas u is not, we must have Co(e u i) > (o(e). 

Assume first that \V V \ > 3. Let T' denote the X-tree obtained from T by deleting the 
edge e u i and adding the edge e* = {u,u'}. Clearly, T' is not a refinement of T. Consider 
the edge-weighting 



co' :E(T') ^R> : 




(o(e) else. 



Then it is easy to see that co' is equidistant and proper. Since, by construction, (T. co) and 
(T', CO 1 ) must be _Sf -isometric it follows that T' is a refinement of T as _Sf coralls T; a 
contradiction. 

Now assume that |V V | = 2. If v ^ pr then v must have a parent w G Let T' denote 

the X-tree obtained from T as before except that we now suppress v as this has rendered it a 
vertex with a single child. Let co' be the edge-weighting for T' as defined above except that 
we put co'({u,w}) = co({u, v}) + Co({v,w}). Then the same arguments as in the previous 
case yield a contradiction 

If v = then let T' denote the X-tree obtained from T by collapsing the edge {v, u}. 
Clearly, T' is not a refinement of T. Consider the edge-weighting co' : E(T') — > IR>o for 
T' defined by putting co' = Co\ E i T /y Then the same arguments as in the previous two cases 
yield a contradiction. 

(ii): Assume for contradiction that there exists some v G V"(T) —V"(T)~ such that 
G(Jzf ,v) is not connected. Then every child of v is a leaf of T and there exist vertices 
ei,e2 G V v distinct such that e\ and ei are not joined by a path in G{££ , v). Let G\ and G2 
denote the connected components of G(Jzf,v) containing e\ and ex, respectively. For all 
children u G V(T) of v, let e u denote the child edge of v incident with u. Note that since 
T is not the star-tree on X there must exist a vertex w G V(T) that is the parent of v. Let 
T' denote the X-tree obtained from T via the following process. Let i = 1,2. Then, for all 
iij G V(Gj), subdivide the edge e Uj by a new vertex p Uj not already contained in V(T). Next, 
identify all vertices p u . into the vertex p, and then delete all copies of the edges {v,p, } from 
T. Finally, add the edge {w,p t } and suppress Pi if |V(G,-)| = !• If V v = V(Gi) UV(G 2 ) 
then also suppress the vertex v. The resulting tree is T' and, in either case, T' is clearly not 
a refinement of T. 

Let co denote an equidistant, proper edge- weighting for T. Note that co(e\) = co(e) must 
hold for all e G V v , as v G V"(T) — V"(T)~ . For the following, assume first that neither p\ 
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nor p2 have been suppressed in the construction of T'. Consider the edge-weighting 

!G)(e) ifpi,p 2 ge, 
CO({v,w}) ifee{{w, Pl },{w,p 2 }} 
eo(ei) else. 

Then, by construction, co' is equidistant and proper and (T, co) and T', co') are Jzf -isometric. 
Since, by assumption, Jz? coralls T it follows that T' is a refinement of T; a contradiction. 

In case one of p\ and p2 or both of them have been suppressed in the construction of T' 
the definition of the edge-weighting co' for T' is similar to the one above thus leading to a 
contradiction in these cases too. □ 



The next result is a strengthening of Lemma BTTT i). To state it, we require further termi- 
nology concerning child-edge graphs. Suppose T is an X-tree, v £ V°(T)~, and ££ C (~) 
is a non-empty set of cords. Then we denote by £/(v) C E(T) the set of child edges of 
v that are incident with a leaf of T and by E s (v) C E(T) the set of child edges of v that 
are not contained in £/(v). Note that E s (v) is empty if and only if v is the parent of a 
pseudo-cherry of T. Also note that £y(v) = might hold. Clearly, if neither of them is the 
empty-set then {E/(v),E s (v)} is a partition of V v . For v 6 V°(T)~, we say that G(Jzf ,v) 
is rich if the subgraph Gr(_Sf, v). s of Gr(-Sf,v) induced by E,s(v) is a clique, and, in case 
£;(v) ^ 0, we have for all e £ £/(v) and all e' £ £y(v) that {e,e'} £ £ v . As before we will 
write G(££ , v) s rather than Gj (Jzf , v) s if there is no ambiguity with regards to which X-tee 
T we are referring. Note that with T and v as above, if £/(v) = then G(Jzf,v) is rich if 
and only if G(j2f,v) s is a clique. 

Proposition 4.2. Suppose that T is an X-tree but not the star-tree and that Jz? C 

is a non-empty set of cords. If ^£ coralls T then G(Jz?,v) must be rich, for all vertices 

veV"{T)-. 

Proof. Let co denote an equidistant, proper edge-weighting for T and assume for contra- 
diction that there exists a vertex v £ V"(T)~ such that G(^f , v) is not rich. We first show 
that G(Jzf,v) s must be a clique. Suppose G(J?,v) s is not a clique, that is, there exist dis- 
tinct child edges e and e' of v contained in V(G(J?,v) s ) such that with v e and \y denoting 
the children of v incident with e and e', respectively, we have, for all a £ L{y e ) and all 
b £ L(vy) that ab g if. Note that |V V | > 3 must hold. Indeed, if |V V | = 2 then V v = {e,e'} 
and so {e,e'} £ E v since, by Lemma |4~T1 G(if,v) is connected. But then G{££ \v) is a 
clique and so G(J*f , v) s is a clique; a contradiction. Without loss of generality assume that 
©(e) > ffl(e'). 

If o(e) > Co(e') then consider the X-tree T' obtained from T by deleting the edge {v, v e } 
and attaching v e to vy via the edge e* = {v ei v e i}. Clearly, T' is not a refinement of T. 
Consider the edge-weighting co 1 for T' defined by putting 



0)':£(r')^R>o: /' 



(0(f) iff^e* 
C0(e)- co(e') else. 



Clearly, co' is equidistant and proper and, by construction, (T, co) and (T',co') are ££- 
isometric. Since if coralls T it follows that T' must be a refinement; a contradiction. 

If ft)(e) = co(e') then consider the X-tree T' obtained from T by first identifying the 
vertices v e and v e i (keeping the label v e ) and then deleting one of the edges from v to v e . 
Again, T' is clearly not a refinement of T. Consider the edge-weighting co' : E(T') — > R>o 
defined as co' = Co\ E tjiy Then the same arguments as before imply that T' is a refinement 
of T; a contradiction. Thus, G(Jzf , v) s is a clique, as required. 
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Now if E[(v) = then G(if,v) must be rich. So assume that £/(v) ^ 0. But then 
Lemma FkTT i). implies that G(if , v) must be rich. □ 

We conclude this section with a result that will be useful for establishing the aforemen- 
tioned characterization of weak lassos in terms of child-edge graphs (Theorem 15. U . Its 
proof relies on the well-known f act that an X-tee T' is a refinement of an X-tree T if and 
only ifM(T) C&(T') (see e.g. dSemple and SteelL 120031 Theorem 6.4.1)). 



Lemma 4.3. Suppose that T is an X-tree that has a unique cherry x,y and that if C ( 2 ) 
is a set of cords that contains the set {xy} U {az '■ Z G X — {x,y} and a = x or a = y}. Then 
if coralls T. 

Proof. By Theorem l3.2l it suffices to show that if' := if (x,y) U {xy} coralls T. Suppose 
there exists an X-tree T' and equidistant, proper edge-weightings to and to' of T and T', 
respectively, such that (T, to) and (T', to') are if' -isometric. To see that T' is a refinement 
of T it suffices to show that xy\z G &{T') holds for all z G X - {x,y}. Let z EX — {x,y}. 
Then zy G if'. Combined with the facts that xy G if' and that xy\z G ^(T) it follows, by 
Lemma |3~T1 that xy\z G M(T'), as required. □ 

5. A CHARACTERIZATION OF A WEAK LASSO 

In this section, we characterize sets of cords of X that corall an X-tree T in terms of 
two properties on the child edge-graphs associated to the interior vertex of T . In addition, 
we present a simple example that illustrates that this characterization does not hold if the 
equidistant requirement for the two edge-weight functions mentioned in the definition of 
such a lasso is dropped. 

Theorem 5.1. Suppose that T is an X-tree but not the star-tree and that if C (^) is a 
non-empty set of cords. Then if is a weak lasso for T if and only if the following two 
properties hold 

(CI) G(if,v) is rich, for all v G V (T)~ 

(C2) G(if,v) is connected, for all v G V"(T) - V°(T)-. 

Proof. Assume first that if coralls T. Then Properties (CI) and (C2) must hold by Propo- 
sition |4T2] and Lemma |4~TT ii). 

To see the converse, assume that Properties (CI) and (C2) hold. We prove that if must 
be a weak lasso for T by induction on the size « of X. Note that the statement clearly holds 
in case « = 3 as then T is binary and a refinement of a binary X-tree is the tree itself. So 
assume that n = 4. Then since T is not the star-tree on X, we have 2 < |V°(r)| < 3. If 
|y°(r) | = 3 then T is binary and so the statement holds. So assume that \V° (T)\ = 2. Then 
T has a unique cherry x,y and so, by Lemma l4~3l if coralls T in this case too. 

Assume that the statement holds for all finite sets of size n > 4 and let X denote a set 
of size n + 1. Let T be an X-tree that is not the star-tree on X and let if C ( 2 ) denote a 
non-empty set of cords such that Properties (CI) and (C2) are satisfied for T and if. Note 
that T must contain at least one pseudo-cherry. To see that if coralls T, let T' denote an 
X-tree and to and to' equidistant, proper edge-weightings for T and T' , respectively, such 
that (T, to) and (T', to') are if-isometric. We distinguish between the cases that (i) every 
pseudo-cherry of T is in fact a cherry of T and (ii) that T contains a pseudo-cherry that has 
at least three leaves. 

Case (i): Assume that every pseudo-cherry of T is a cherry and let x,y G X such that x,y 
is a cherry of T. Note that since « > 5 and T is not the star-tree on X, there must exist a 
vertex w G V(T) that is the parent of v x (which is itself the parent of the cherry x,y). Put 
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X\ =X — {x} and «5fi = Ji?i(x,y). Note that since G(Jz?,v v ) is connected, it immediately 
follows that xy G J?f and that since G(Jz? ,w) is rich we must have Jz?i ^ 0. Let T\ denote 
the X\ -tree obtained from T by deleting x and its incident edge and suppressing v x as this 
has rendered it a vertex with a single child. Let 



Clearly, ©i is equidistant and proper and T\ is either the star-tree on X\ or not. Assume 
first that T{ is the star- tree on X\. Then x,y is the unique cherry of T and all elements 
z G X{ — {y} are adjacent with the root pj of T which is w. But then Properties (CI) and 
(C2) combined with Lemma l-Ol implv that Jzf coralls T. 

So assume that T\ is not the star-tree on X\. We claim that Properties (CI) and (C2) 
hold for T\ and We start with establishing Property (CI). Assume for contradiction 
that there exists some u G V°(T\)~ such that G^i^fiju) is not rich. We first show that 
G^, (J^i,u) s must be a clique. Assume for contradiction that G^, (J*?i,u) s is not a clique, 
that is, there exist vertices e and e' in Gt { {J£\,u) s such that for all a € («e) and all 
£> G Ljj we have that ab JO\ where u e and u e i denote the children of u in T\ incident 
with e and e' ', respectively. Note that V"(T\) U {v Y } = V"(T) and that u = w must hold. 
Indeed assume for contradiction that u ^ w. Then v x is not a child of u in T and so u e and 
m c / are also children of u in 7\ Since Gt(£?,u)s is a clique by Property (CI), there must 
exist a <E Lj{u e ) and b 6 Lr{u e i) such that a£> e «Sf. If x then, by the definition 

of Jzfi, we have ab € a contradiction. Thus, x e {a,fe}. Without loss of generality 
assume that x = a. Then y S Lj{u e ) and, again by the definition of Jz?i, we obtain y£> G i ; 
a contradiction. Thus m = w, as required. But then y ^ {u e ,u e i} and so one of e and e' is not 
a vertex in Gj, (j§fi , u) s \ a contradiction. Thus, Gt { (-S^i , «)i must be a clique, as required. 

Since, by assumption, Gt { (=Sfi,w) is not rich, there must therefore exist a leaf z of T\ 
with e' = G E(T\) holding and some vertex e in Gt { (J^i,u) s such that {e,e'} is not 

an edge in Gj { (J?i,u). Let u e denote the child of u in T\ incident with e. If u ^ w then 
since the children of u in T are precisely the children of u in T\ and, by Property (CI), 
Gj(J£ \u) is rich we obtain a contradiction. Thus, u = w. But then y = z must hold. Since 
Gt(£?, w) is rich there must exist some a G {x,y} and some G L(u e ) such that afe G J£ . 
But then yfe G and so {e,e'} is an edge in G^, (^fi,«), a contradiction. 

We next establish that Property (C2) is satisfied by 7\ and which will conclude the 
proof of the claim. Let u G V"(T\) — V"(T\)~ . Then u must be the parent of a pseudo- 
cherry of T\. If m ^ w then since, by assumption, every pseudo-cherry of T is a cherry 
of r it follows that u is the parent of a cherry of T\. But then Gr, (J£i,u) is connected as 
Gr, (^fi , u) = Gt(£?, u) and Property (C2) is satisfied by T and Jz? . So assume that m = w. 
Then m is the parent of v x in T and all children of u in T but are leaves of T. Since, 
by Property (CI), Gr(^f ,u) is rich, there exists for all children z of u that are leaves of T 
some b z G {x,y} such that b z z G . But then yz G ^fi for all such children z of w and thus 
Gt^-JS^m) is connected, as required. Thus Property (C2) is also satisfied by T\ and 
which completes the proof of the claim. By induction, it follows that is a weak lasso 
for 71 . 

Let T( denote the Xi-tree obtained from T' by deleting x and its incident edge and 
suppressing the parent vertex of x in T( if this has rendered it a vertex with a single child. 
Let (o[ denote the edge-weighting of T[ that is canonically induced by ©i on the edges 
of T[. Then, by Lemma [3T| combined with the assumption that (T,co) and (T',co') are 
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_£? -isometric, it follows that (7i,G)i) and (T(,a>[) are Jzfi-isometric. Since is a weak 
lasso for T\ this implies that T[ is a refinement for T\ . 

To establish that _Sf coralls 7" it now suffices to show that xy\a £ ffl{T') holds for all 
a £ £(w) — To this end, note that since Property (CI) is satisfied by T and ££, we 

have for all children a £ L(w) that are leaves of T that there exists some b £Lj (v x ) = {x,y} 
such that afe £ S£ . Combined with Lemma [37X1 and xy £ J*f, it follows that D(j )£0 ) = 
D( r / i(0 /)(x,a) = Di Ta \{y,a) = D^ T i a ^(y,a). Since xy|a £ we obtain xy|a £ &(T'), 

as required. This completes the proof of the induction step in this case. 

Case (ii): Assume that T contains a pseudo-cherry c of size three or more. Let v £ V°(T) 



denote the parent of c in T. Then, by Property (C2), G(Jz?,v) is connected. By dDiestel , 



Electronic Edition 20051 Proposition 1 .4. 1), we may choose some x £ X such that the graph 



G~(J2?, v) obtained from G(_£f,v) by deleting e x and its incident edges is connected. Note 
that since x is a leaf in c, we have v = v x . 

Put X\ := X — {x}, choose some y £ L(v x ) such that xy £ _Sf, and put «ifi := Jzf (x,y). 
Consider the X-tree T\ obtained from T by deleting x and its incident edge. We claim again 
that Gt { satisfies Properties (CI) and (C2) for all u £ V°(T\) as specified in those 

conditions. To see this, note first that since v x has at least two children in 7\, we have 
V°(Ti) = V°(T) and so also V°(Ti)- = V°(T)-. 

We start with establishing Property (CI). Assume for contradiction that there exists 
some vertex u £ V°(T\)~ such that Gt x {££\,u) is not rich. We again show first that 
Gr t (J^i,u) s is a clique. Assume for contradiction that this is not the case and let e, <?', 
u e , and Ugi be as in the corresponding situation in the previous case. Note that since 
u £ V"(Ti)- = V°{T)~ and v x £ V°(T) - V (T)~ we have v x ^ u. But then similar ar- 
guments as the ones used to show in Case (i) that w ^ u in the context of establishing 
that Grj (J?i,u) s is a clique yield a contradiction. Thus, G^, (Jz?i,u) s must be a clique, 
as required. As in the previous case, there must therefore exist some z £ X\ such that 
e = {z,u} £ E(T{) and some vertex e' in Gj s {££\,u) s such that {e,e'} is not an edge in 
Gjj (J£\ , u). Note that the same arguments as above imply that u ^ v x . By the definition of 
Jzfi, it follows that x must be the unique leaf in Lj(u e i) such that xz £ % '. Since y and x 
are leaves in the same pseudo-cherry of T, we obtain yz £ Consequently, {e,e'} is an 
edge in G^ (^fi,M), a contradiction. Thus, Property (CI) is satisfied by 7] and jSfi. 

To see that 7i and Jz?i satisfy Property (C2) assume without loss of generality that there 
exists some u £ V°(T) — V°(T)~ such that Gj { {££\ , u) is not connected. Then u ^ v x , by the 
choice of x. Since u is the parent of a pseudo-cherry in T it follows that m must be the parent 
of the same pseudo-cherry in T\ . But then G^ ( Jr?i , u) = Gj «) and so G^ (_Sfi , u) must 
be connected as, by Property (C2), Gj (Jiff, u) is connected; a contradiction. This concludes 
the proof of the claim. By induction, it follows that Jz?i coralls T\ . 

Let T[, w, and a>[ be as in the previous case. Then, as in that case, T[ must be a 
refinement of T\ . We claim that for all children u of w in T distinct from v x and all a £ Lj (h) 
we must have 

xy|a£^(T'). 

To see this, note first that Property (CI) implies for all such children u of w that there 
must exist some a u £ Lj(u) and some z Vr £ Lr(v^) such that a u z Vx £ jSf . Let u denote a 
child of w in 7" distinct from v x and put a~a u and z = z,, r . We show first that 

(4) D (T,co){a,y) = D (T',<o<){ a ,y) wdD (T,m)( a , x ) = D (T',a')( a > x )- 

Clearly, if z £ {x,y} then one of the two equations in must hold. Assume without loss of 
generality that z / x. Since Gt(^,v x ) is connected by Property (C2), it follows that there 
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exists a path z = Zi,Z2, ■ ■ ■ ,Zk =x, k > 2, fromz to x in Gr(Jzf , v x ). But then Lemma ITTl i) 
implies D( T a) (a,jc) =D( 7 -/ (B /)(fl,x) since, for all 1 < ;' < k— 1, we have ZiZi+t G Jzf. Ifz^y 
then similar arguments imply that Di TM ){a,y) = Di t i jC0 ')(a,y). If z = y then ya = za G Jzf 
and so D^ T ^(a,y) = D( T i a i^(a : y). Thus both equations in © must hold, as required. 
Combined with xy G Jz? (which holds by the choice of y) and the fact that xy\a G &(T) we 
obtain 

xy\a G^(r') 

in view of Lemma l3.ir ii). Thus the claim follows if |Z/t(m)| = 1. So assume that |Lr(«)| > 
2. Suppose a' £ Lj{u) — {a}. Then y\a!a £ffl(T{) must hold as T[ is a refinement of T\ and 
y\a'a € M(Ti). Since the only {x,y,a',a}-tree that can simultaneously display the triplets 
y\a'a and xy\a is the tree with cherries x,y and a', a and that tree is equivalent with the tree 
T\{x,y,a',a} it follows thatxy|a' G ,^(r'), as claimed. 

Combined with the fact that T[ is a refinement of T\ it follows that T' is a refinement 
of T. Hence, Jz? coralls T which concludes the proof of the induction step in this case too 
and, thus, the proof of the theorem. 

□ 

Note that Theorem 15.11 immediately implies that for a non-empty set of cords of X 
to be a weak lasso for an X-tree T that is not the star-tree on X, it must be a covering 
of X, that is, X = \J Ae ^A. Also note that it immediately implies that a minimum size 
weak lasso for T must have £ v6VO(r) - (( |v(G( f ' v)l)l ) + \V(G(5?,v) s )\ x |V(G(j£f ,v)/)|) + 
L,eV°(7*)-v°(r)- |V V | cords. Thus, such a lasso has at most |V°(r)-| (™) + (m- l)|V°(r) - 
V°(r) _ | cords where m = max v6 yo( r )- |V V | and at least (/ — 1)|V°(7*)| cords where I = 
min veV o( T y \V V \. Note that these bounds are sharp in the case that all interior vertices of T 
have the same number k of children. In the former case T is such that no interior vertex of 
T that is not a parent of a pseudo-cherry is adjacent with a leaf of T. In the latter case T is 
the bearded caterpillar tree on X, that is, T is a (rooted) path and every vertex of that path 
is adjacent with k — 1 leaves but the end vertex of T that is not pr which has k children. 

Finally, note that as the example presented in Fig. [2] illustrates, for the theorem to 
hold the requirement that both proper edge-weightings are equidistant in the definition 
of a weak lasso cannot be dropped. More precisely for X = {a,b,c,d,e,f} and Jz? = 
{ab, be, bd, af, ae} the X-tree T pictured on the left of that figure (with the indicated edge- 
weighting (0 ignored for the moment) satisfies Properties (CI) and (C2) and the X-tree T' 
depicted on the right of that figure (again with the indicated edge-weighting ft)' ignored 
where < e < 1) is clearly not a refinement of T. Also ft) and ft)' are obviously proper and, 
in the case of ft), equidistant and (T, o) and (T', ft)') are ££ -isometric. 

6. A CHARACTERIZATION OF AN EQUIDISTANT LASSO 

In this section, we present a characterization of an equidistant lasso Jz? C rP) for an X- 
tree T in terms of the child-edge graphs associated to the interior vertices of T. To establish 
it, we require a further notation. Suppose T is an X-tree and v and w are two vertices of T. 
Then we denote by Ej (v, w) the set of all edges of T on the path from v to w. 

Theorem 6.1. Suppose T is an X-tree and _S? C (,) is a non-empty set of cords. Then the 
following are equivalent: 

(i) Jz? is an equidistant lasso for T. 

for every vertex v G V" (T), the graph G{J£ , v) contains at least one edge. 
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Figure 2. An example illustrating that for Theorem 15. II to hold the 
equidistance requirement in the definition of a weak lasso cannot be 
dropped (see text for details). 



Proof, (i) =>■ (ii): Suppose J>f is an equidistant lasso for T and assume for contradiction 
that there exists an interior vertex v of T for which G(^f,v) does not contain an edge. 
Note first that v = Pt cannot hold. Indeed, suppose ©i is a equidistant, proper edge- 
weighting for T and let e > be a sufficiently small real number. Consider the edge- 
weighting ©2 : E(T) — > R>o defined by putting, for all e G E(T), (02(e) = ©1 (e) — e if 
Pt € e and ©2(e) = CO\(e) else. Clearly, ©2 is an equidistant, proper edge-weighting for 
T distinct from ©1 and (T, C0\) and (T, ©2) are Jzf-isometric. Since, by assumption, _§f is 
an equidistant lasso for T it follows that ©1 = ©2; a contradiction. Thus, v / Pt and so 
there must exist a parent w € V(r) of v in 7\ Let ©1 denote again an equidistant, proper 
edge-weighting for 7\ Consider the map ©2 : E(T) — > R>o defined by putting 

{ffli(e) ifv£e, 
©i(e)-e ife = {v,w}, 
©i(e) + £ else, 

where e > is small enough. Clearly, ©2 is an equidistant, proper edge-weighting for T 
which is distinct from a>\. By construction, (T, ©1) and (T, ©2) are ££ -isometric and so 
©1 = ©2 must hold as Jz? is an equidistant lasso for T; a contradiction. 

(ii) => (i): Suppose that, for all v G V°(T), the graph G(^f,v) has at least one edge 
and assume for contradiction that ££ is not an equidistant lasso for T. Then there exist 
distinct equidistant, proper edge-weightings ©1 and ©2 for T such (T, ©1 ) and (T, ©2) are 
S£ -isometric. Thus, there must exist some edge e € E(T) such that ©1 (e) 7^ ©2(e). 

Assume without loss of generality that e = {v, vq} is such that with v being the parent 
of vo we have that ©1 (e 1 ) = ©2(e') holds for all edges e' of T that lie on a path from vo 
to a leaf of T contained in L(vo). Note that vo could be a leaf of T in which case such 
a path has length zero. Let vi, V2, ■ ■ ■ , V/, / > 1 denote the other children of v. Then, by 
assumption, G(Jzf,v) contains at least one edge and so there exist i,j G {0,/} distinct and 
leaves yi,yj G L(v) that are descendants of v; and Vj (or coincide with them), respectively, 
such that yiyj G ££ . Let z G L(vo) denote a leaf of T. By Property (El), we obtain 

©i(e)+ £ ©i(e') = D^ ) {v,z) = -D { T^ ) {y h y j ) = ^D {T ^ ) (y u y j ) 

e'€E T (v Q ,z) 

= D {TtCOl) (v,z) = £ ©2(c') + ®2(e)- 

e'e£r(r ,z) 

and so ©i(e) = ©2(e) follows by the choice of e; a contradiction. Thus, _£? must be an 
equidistant lasso for T. □ 
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Theorem l6. ll immediatelv implies that an equidistant lasso on X need not be a covering 
of X. Also, it implies that the size of a minimum equidistant lasso for an X-tree T is 
|V°(r)|. For the extreme cases that T is the star tree on X such a lasso has precisely one 
element and if T is binary such a lass o has \X \ — 1 elements a s any such tree is known to 



have |X| — 1 interior vertices (see e. g. lSemple and Steell d2003l) ) 



Theorem 15 . 1 1 combined with Theorem l6. ll immediatelv implies the following link be- 
tween equidistant and weak lassos. 

Corollary 6.2. Suppose T is an X-tree and .5? C ( 2 ) is a non-empty set of cords. Then Jzf 
is an equidistant lasso for T whenever it is a weak lasso for T. 

We remark in passing that dropping the requirement that the two edge-weightings have 
to be equidistant in the definition of an equidistant lasso gives rise to the definition of an 
edge-weight lasso for an X-tree. However it is easy to see that Theorem l6.1l does not hold 
with equidistant lasso replaced by edge-weight lasso. 

7. A CHARACTERIZATION OF A TOPOLOGICAL LASSO 

In this section, we prove the companion result for when a set of cords is a topological 
lasso for an X-tree T in terms of the child-edge graphs associated to the interior vertices of 
T. We start again with some more notation. 

Suppose that T is an X-tree and that v G V°(T) but not the root of T. Then we denote 
the L(v)-tree obtained from T by deleting the parent edge of v by T L r v y Now suppose that 
Y C X is such that there exists some v G V"(T) such that Y = L(v). Then we denote the 
root of Ty by p (TV). 

Theorem 7.1. Suppose T is an X-tree and C is a non-empty set of cords. Then the 
following are equivalent: 

(i) «S? is a topological lasso for T. 

(ii) for every vertex v G V°(T), the graph G(J£ ', v) is a clique. 

Proof, (i) (ii): Suppose that Jz? is a topological lasso for T and assume for contradiction 
that there exists some vertex v G V°(T) such that G(j£f ,v) is not a clique. Then there exist 
child edges e,e' G E(T) of v such that {e,e'} g" E v . Let v e and v e i denote the children of v 
incident with e and e', respectively, and let CO denote an equidistant, proper edge-weighting 
of T. We distinguish the cases that (i) v is the parent of a pseudo-cherry of T and that (ii) 
v G V"{T)-. 

Case (i): Assume that v is the parent of a pseudo-cherry of T. Then since every topo- 
logical lasso for T is in particular a weak lasso for T, Theorem |5. ll implies that G(Jff , v) is 
connected and that, in addition to x := v e and y :=v e i, there must exists a further child of v 
(that is a leaf of T). Note that since co is equidistant, we must have co(e) — ffl(e'). 

Let T' denote the X-tree obtained from T by subdividing the edge e' by a vertex w that 
is not already contained in V(T), adding the edge {x, w} and deleting the edge e. Clearly, 
T and T' are not equivalent. Let < £ < ©(e) and consider edge-weighting 

((0(f) ifwG7, 
(o':E(T')^R> : /-> < co(f)-e if / G {{w,x},{w,y}}, 
[e else. 

Clearly, co' is equidistant and proper and it is straight forward to see that (T 1 ,co') and (T, co) 
are ££ -isometric. Since «Sf is a topological lasso for T it follows that T and T' must be 
equivalent, a contradiction. 
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Case (ii): Assume that v G V°(T)~. Then since every topological lasso for T is in 
particular a weak lasso for T, Theorem 15.11 implies that G(Jzf,v) is rich. But then v e and 
Vgi must be leaves of T. With x = v e and y = iy we obtain a contradiction using the same 
arguments as in Case (i). 

(ii) =>■ (i): Suppose that, for every vertex v G V°(T), the graph G(_Sf,v) is a clique and 
assume that T' is an X-tree and ft) and ft)' are equidistant, proper edge-weightings for 7 and 
T', respectively, such that (T, ft)) and (T, ft)') are Jzf -isometric. If 7" is the star-tree on X 
then Jz? must necessarily be a topological lasso for T. So assume that T is not the star-tree 
on X. Then, by Theorem l5.ll Jz? is a weak lasso for 7" and so 7*' must be a refinement of 

r. 

We next show that T and T' are in fact equivalent. Assume for contradiction that T and 
T' are not equivalent. Then there must exist a non-empty subset Y CX of leaves of T such 
that the subtree Ty of T with leaf set Y is equivalent with the subtree Ty of 7*' with leaf set Y 
but the subtrees of T and T' with root the parents of p (Ty) and p (Ty), respectively, are not. 
Let w denote the parent of p(Ty) in T and w' the parent of p{T Y ) in T', Then there must 
exist some z G L(w') -L(p(T Y )) and distinct x,y G L(p(T Y )) = Y such that xy|z G M(T') 
and xy|z &(T), Hence, p{Ty) must lie on the path from x to y in 7\ Combined with 
the fact that T' is a refinement of 7, it follows that T\^ xy ^ is the star-tree onZ:= {x,y,z} 
whose unique interior vertex is p (Ty). Since ft) is equidistant, we obtain 

(5) D {Tm) (x 7 y) = D {Tm) (x,z) = D [Tm) (z,y). 

Lete x ,e y ,e z &E(T) denote the child edges of p(Ty) that are crossed by a path from p(Ty) 
to x, y, and z, respectively, and let v fj denote the child of p(Ty) incident with e s , for all 
s G Z. Since, by assumption, G(Jf,p(Ty)) is a clique there must exist leaves a G L(v ex ), 
b G L(v ev ), and c G L(v e .) such that ab,bc,ca G JSf. By the same reason, it follows in view 
of Lemma [3TI that D(r,ea) (p,?) = D{p ,m')(Pil) must hold f° r an y two elements p,q G Z 
distinct. Combined with Equality (0, we obtain Dtjt m i)(p,q) — D(r' )£0 ') (/?,<?), for any two 
such elements p and ^. But then T'\z must be the star-tree on Z and so xy\z £ M(T'), a 
contradiction. □ 

Note that Theorem 17.11 immediately implies that a topological lasso _£f C (,) must be 
a covering of X. Also note that it implies that if ££ is a topological lasso for an X-tree T 
then .if must contain at least Y<veV°(T) ( 2 ) cords and that «5f = (^) must hold in case T is 
the star-tree on X. Finally, note that as the example presented in Fig. |3]shows, the require- 
ment that the two proper edge-weightings in the definition of a topological lasso must be 
equidistant cannot be dropped. More precisely forX = {a,b,c,d} and _£? = {ab,cd,ad} 
the X-tree T pictured on the left of that figure (with the indicated edge-weighting ft) ig- 
nored for the moment) satisfies Properties (CI) and (C2) is not equivalent with the X-tree 
T' depicted on the right of that figure (again with the indicated edge-weighting ft)' ignored 
where < e < 1). Also ft) and 0' are clearly proper and, in the case of ft), also equidistant 
and (T*, ft)) and (T 1 , ft)') are ^f-isometric. 

Combining Proposition ^. H and Theorem l7.ll we obtain the following corollary. 

Corollary 7.2. Suppose that T is an X-tree. Then every topological lasso for T must 
also be an edge-weight lasso for T. Moreover, ifT is binary and Jz? C (,) then _Sf is a 
topological lasso for T if and only if Jz? is an equidistant lasso for T. In particular, every 
topological/ equidistant lasso for T is also a strong lasso for T in this case. 
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a b c d abed 



Figure 3 . An example showing that for Theorem 17.11 to hold the 
"equidistant" requirement in the definition of a topological lasso cannot 
be dropped (see text for details). 



Note however that for Jz? C a set of cords and T an X-tree it is possible that Jz? 
is simultaneously an edge-weight and a topological lasso for T but T is not binary. We 
present such an example in the next section. 



8. Examples of lassos 



In this section we apply our findings to two types of constructio ns of sets of cords of 
X. Both of them were originally introduced in Dress et al ( 2012bl) for the case of edge- 



weighted, unrooted, phylogenetic trees with leaf set X where an edge-weighting for such a 
tree is defined as in the case of an X-tiee. 

Assume for the remainder of this section that T is an X-tree. Then the first example 



relies on the notion of a circular ordering (of the leaf set) of T (see e.g . Semple and Steell 
d2003l) for further details on such orderings). Following Dress et a! ( 20 12b ) a circular 



ordering (ofX) of T is a cyclic permutation a of X such that the following holds. There 
exists a planar embedding of T such that, for every the leaf that is encountered 

after x when traversing T in that embedding in say, a counter-clockwise fashion, is the leaf 
<x(jc). For example, (a,b,c,d) is a circular ordering of T the {a,b,c,d}-tiee depicted in 
Figure[3la). 

Let (x\,X2, ■ ■ ■ ,x n ) denote a circular ordering of the leaves of T where n := \X\ and put 
x n +i = x\ . Then since for every interior vertex v of T there exists some i 6 { 1 , . . . , n} such 
that v lies on the path from x, to Jtj+i, Theorem l6. 1 l implies that the set 

££ = {%iX i+ \ : 1 </<«} 

is an equidistant lasso for T. However, Jz? is clearly not an equidistant lasso of minimal 
size. In view of Theorem 17. II «5f is a topological lasso for T if and only if every interior 
vertex v £ V"(T) has degree three (except possibly the root pj which might also have 
degree two) as in that case G(_Sf,w) is a complete graph for all w <E V"(T). Thus, Jz? is 
also a strong lasso for such X-trees. In view of Theorem |5.1| Jz? is a weak lasso for T if and 
only if every vertex v £ V°(T)~ has degree three (except possibly the root pj which might 
also have degree two) as in that case G(Jz?,w) is rich for all w € V°(T)~ and connected for 

aiiwey (r)-y°(r)-. 

The next construction re lies on the notion of a bipartition {A,B} of X and was intro- 
duced in Dress et a 3 d2012bl) where it was shown that the set 



AVB := {ab S ( ) : a e A and b e B} 
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is a topological lasscQ for an unrooted phylogenetic tree T' with leaf set X if and only 
if, for every 2-subset c of X whose elements have that same parent in T', we have that 
AncV=07^Bnc. Note that this implies in particular that for an unrooted phylogenetic tree 
on X to be topologically lassoed by A VB, every interior vertex of T can be adjacent with 
at most two leaves. Thus every pseudo-cherry of T' (defined as in the case of an X-tree) 
must be a cherry of T. 

Defining for an unrooted phylogenetic tree T' on X a set S£ C (*) of cords to be an 
edge-weight lasso as in the case of an X-tree but again with the requirement "equidistant" 
on the proper edge-weightings removed, it is not difficult to see that A V B is not an edge- 
weight lasso for T 1 . Also it is straight forward to see that any two proper edge-weightings co 
and co' for T' such that Dm >(B ) (a,b) ~ D(ji m i^(a,b) holds for all ab £ Jzf must coincide on 
the interior edges of T' where for a proper edge-weighting a of T' we denote the induced 
distance on V(T') also by D( T i a y 

In the case ofX-trees the situation changes in so far that if T is not the star-tree onl and 
{A,B} is such that every pseudo-cherry of T contains elements from both A and B then, in 
view of Theorem 15. II A V B must be a weak lasso for T and thus, by Corollary 16.21 also 
an equidistant lasso for T, In view of Theorem 17. 11 A VB is not a topological lasso for T 
unless every interior vertex v G V"(T) of T is incident with at most two leaves of T and, 
if v is incident with two leaves, then one is contained in A and the other in B as otherwise 
G(A V B,v) would not be a clique. Since for such X-trees T we have, for all v € V°(T), 
that G(Jzf , v) contains at least one edge it follows that A V B is a strong lasso for T. 

If T is the star-tree on X then A VB is a weak lasso for T. Also, A VB is an equidistant 
lasso for T as G(j£,pr) contains at least one edge but it is not a topological lasso for T as 

9. Conclusion 

In the form of investigating when a set of cords of a finite set X of size at least three is 
an equidistant/topological/weak/strong lasso for an X-tree, we have addressed the topical 
problem of when a set of partial distances for a set of individuals within a variety uniquely 
determines a dendogram on those individuals. Such structures are commonly constructed 
as part of a phenetic clustering step within a genomewide association study to better un- 
derstand the link between phenotypic and a genotypic variation within a variety. For T 
an X-tree and Jzf C (~) a set of cords, we present characterizations for when Jzf is an 
equidistant/weak/tropological lasso for T in terms of the structure of the child-edge graphs 
associated to the interior vertices of T. As immediate consequences, our characterizations 
allow us to not only shed light into the problem of when two of the above types of lassos 
coincide but also into the size of minimum size equidistant/topological/weak lassos. 

Despite these encouraging results a number of open questions remain. For example, the 
characterizations above require knowledge of the structure of T in the form of the child- 
edge graphs associated to the interior vertices of T. Thus, is it possible to characterize or 
at least understand lassos without this structural insight into T, A potential candidate for 
this might be the graph r(j£?) associated to Jzf whose vertex set is X an d whose edge set is 



Jzf. The underlying rational for this is that for \X\ > 4, it was shown in dDress et all 1201 2b , 
Theorem 1) that for Jzf to be a topological lasso for an unrooted phylogenetic tree T on 
X the graph T(Jzf ) has to be connected and for Jzf to be an edge-weight lasso for T it has 



^The definition of a topological lasso for an unrooted phylogenetic tree on X is the same as that of a topological 
lasso for an X-tree but with the requirement dropped that the two proper edge-weightings mentioned in that 
definition are equidistant. 
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to be strongly non-bipartite (where a graph G is said to be strongly non-bipartite if every 
connected component of G is not bipartite). Also for Jzf as constructed in the first example 
in Section[8]the graph r(_S? ) is connected. 

To overco me the potential loss of inf ormation in distance based phylogenetic tree re- 
construction, iPachter and Speven ( 120041) proposed using k-dissimilarities, k > 3, on X 
rather than 2-similarities as is t he case when reco nstructing e dge-weighted phylogeneti c 
trees from dist a nces (see also dFelsenstein , 2003 , p. 176) and Herrmann et all din pressh : 



Warrens ( 2010h : Deza and Rosenberg d200C ) and the references therein for recent work on 



such objects which are sometimes also called A:-way similarities, k-way distances, and k- 
semimetrics). It would be interesting to know what can be said about lassoing and coralling 
of X-trees within this more general framework. 
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